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ABSTRACT 

Display advertising normally charges advertisers for every 
single ad impression. Specifically, if an ad in a webpage has 
been loaded in the browser, an ad impression is counted. 
However, due to the position and size of the ad slot, lots 
of ads are actually not viewed but still measured as im¬ 
pressions and charged. These fraud ad impressions indeed 
undermine the efficacy of display advertising. A perfect 
ad impression viewability measurement should match what 
the user has really viewed with a short memory. In this 
paper, we conduct extensive investigations on display ad 
impression viewability measurements on dimensions of ad 
creative displayed pixel percentage and exposure time to 
find which measurement provides the most accurate ad im¬ 
pression counting. The empirical results show that the most 
accurate measurement counts one ad impression if more 
than 75% of the ad creative pixels have been exposed for 
at least 2 continuous seconds. 
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INTRODUCTION 

Display advertising, i.e. displaying ad creative images on 
the webpages, is one of the mainstream advertising for¬ 
mats for the current popular Internet advertising. During 
Q4 2013 in US, the display advertising accounted for $3.7 
billion, i.e. 30% of total revenues [6]. Moreover, the re¬ 
cently emerged real-time bidding (RTB) techniques act as 
a new paradigm for display advertising [5]. Unlike the tra¬ 
ditional ad inventories buying by bundles, RTB enables the 
advertisers to perform the impression-level bidding based 
on the evaluation of the individual ad display opportunities. 
Such evaluation makes use of various real-time information 
such as the context data, user cookies and interest segments 
[10]. As a result, display advertising, especially its RTB 
part, brings the advertisers a higher retum-on-investment 
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(ROI), and thus more and more budget is allocated into dis¬ 
play advertising [6]. 

Most display ads are charged based on pay per view (PPV), 
i.e. each individual ad impression is charged [4, 8]. The 
current measurement of ad impressions is simply imple¬ 
mented by the loading of the ad creatives. That is to say, 
when a webpage is loaded in the browser, each ad em¬ 
bedded in the page HTML is immediately counted as one 
impression. However, it is possible that some ad creatives 
are not viewed by the user, or even have not been exposed 
in the viewport (i.e. browser window in the screen). We 
refer such ad impressions as fraud impressions. Because 
of the fraud impressions, publishers could earn some easy 
money by trying to add many ads at the bottom of the web¬ 
pages. However, from a long-term perspective, the fraud 
impressions are definitely harmful for display advertising 
eco-system. Firstly, the advertisers have to pay for some 
ads never displayed, which seriously reduces the ROI. Sec¬ 
ondly, the fraud impressions act as the noisy data in the 
machine learning training stage of impression evaluator in 
RTB platform. For example, a click-through rate (CTR) es¬ 
timator is much important in RTB display advertising [10]. 
Its training data consists of the features of each ad impres¬ 
sion with the label showing whether the user clicked the ad 
or not. If an ad impression is logged but the ad was actually 
not shown in the user viewport, such data is a noise which 
should reduce the prediction performance of the trained 
CTR estimator. With more wasted budget and inaccurate 
impression evaluation, the ROI of display advertising is re¬ 
duced, which could possibly prevent the advertisers from 
allocating more budget into display advertising. As a re¬ 
sult, it is much important to find an effective ad impression 
viewability measurement to get rid of such problems. 

A perfect ad impression measurement should just match 
what the user has really viewed with at least a short mem¬ 
ory. Firstly, the ad creative should be really shown in the 
viewport. Secondly, in order to have at least a short impres¬ 
sion of the ad, the ad creative should be exposed in front 
of the user for a piece of time. Therefore, the two impor¬ 
tant factors we care about for measuring the ad impression 
viewability are: (i) pixel percentage, i.e. the percentage of 
pixels of ad creative shown in the viewport, and (ii) expo¬ 
sure time, i.e. how long the ad creative has been shown in 
the viewport. 

In this paper, from a human-computer interaction perspec¬ 
tive, we carefully study the relationship between the user’s 
short memory on the ads with the two factors: pixel per¬ 
centage and exposure time. We track the pixel percentage 



and exposure time of each ad creative using jQuery, based 
on which we develop 11 ad impression measurements. The 
user study experiments involve 20 participants, where each 
of them is asked to freely read some webpages with the 
tracked display ads. Then we check the match degree be¬ 
tween the user’s recalled ad impressions and the counted 
impressions from each measurement. Our empirical study 
shows that the pixel percentage and exposure time are both 
important factors for the user to remember the ads. And 
particularly, the exposure time significantly influences ad 
recall confidence. Specifically, the empirically best mea¬ 
surement counts an ad impression if 75% pixels of the ad 
creative have been shown for at least 2 continuous seconds. 

RELATED WORK 

Since the origin of online advertising, the researchers have 
been seeking a way to answer whether or not the user has 
viewed an ad in one webpage. In [1], the relationship be¬ 
tween the ad creative animation and users’ recognition were 
studied but the authors found such relationship was not sig¬ 
nificant. The authors in [3, 2] leveraged the eye-tracking 
techniques to study the users’ attention area on the screen 
and check how likely the user truly viewed a particular ad. 
Different users and webpages had much different high at¬ 
tention areas [2]. However, such eye-tracking techniques 
are impractical to be used in the production. The most prac¬ 
tical and straightforward method is to define the pixel per¬ 
centage and exposure time as the thresholds of measuring 
a viewed impression across all the webpages and users. In 
2013, Google announced that the advertisers were charged 
only for the viewed ad impressions, where an ad was con¬ 
sidered as viewed only if the pixel percentage was no less 
than 50% and the exposure time was no less than 1 sec- 
ond^ Later in 2014, lAB published a new standard for 
viewable display impressions, where the pixel percentage 
threshold was still 50% while the exposure time threshold 
was 1 second for web display ads, and 2 seconds for video 
ads^. It should be noted that such criteria is a result of bal¬ 
ancing the benefit of advertisers, publishers and intermedi¬ 
aries. In our work, we study the ad impression viewability 
measurements purely from the human-computer interaction 
perspective. 

SYSTEM DESIGN 

The goal of our system is to investigate how the displayed 
pixel percentage and the exposure time influence the users’ 
ad recall, and which impression viewability measurement 
best matches the users’ remembered ad. 

Pixel Percentage Tracking 

The displayed pixel percentage for rectangle ad creative in 
the viewport can be calculated by the displayed height per¬ 
centage times the displayed width percentage. Therefore, 
we tracked the bounds of browser’s viewport and each ad 
creative. 

Figure 1 shows the relationship of the variables. In the 
webpage coordinates, the upperleft point is the origin point. 
The lower place means the higher y-axis value and the right 
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Figure 1. Advanced pixel percentage tracking diagram 

place means the higher x-axis value. Specifically, we cal¬ 
culate four ratios capped by 1: 

Top = min(1, (bounds.bottom - viewport.top) / height) 
Bottom = min(1, (viewport.bottom - bounds.top) / height) 
Left = min(1, (bounds.right - viewport.left) / width) 

Right = min(1, (viewport.right - bounds.left) / width) 
Pixel% = Top X Bottom X Left X Right 

In Figure 1, Top=Left=Right=l, Bottom=0.6, thus 
the pixel percentage is 60%. Given an impression measure¬ 
ment with the pixel percentage threshold 50%, the measure¬ 
ment will count this ad impression. Note that when any of 
the four factors is negative value, the entire ad creative is 
outside of the viewport, thus the pixel percentage is calcu¬ 
lated as zero. 

Exposure Time Tracking 

The exposure time is associated with a pixel percentage 
threshold. For example, if the pixel percentage is 50%, only 
after half pixels have been shown in the viewport does the 
tracking system start to count the exposure time. If we do 
not want any pixel percentage threshold, just set it as 0%. 
If the measured exposure time has surpassed the predefined 
threshold, e.g., 2 seconds, then the measurement counts this 
ad impression. 

Specifically, we used the tick counts based methods to cal¬ 
culate the exposure time. For example, for the measure¬ 
ment of 50% pixel percentage and 2 seconds exposure 
time, the tick counter will start to track the time once the 
pixel percentage meets 50%. Then tick counter calls the 
pixel percentage tracking algorithm every 0.1 second for 20 
times. Every time the pixel percentage tracking algorithm 
checks whether the current pixel percentage is no less than 
50%. If it returns false, the tick counter will restart the 
counting. If the tick counter counts up to 20, the exposure 
time and pixel percentage thresholds are both reached, thus 
the measurement counts this ad impression. 

EXPERIMENT SETTING 
Participants 

20 participants (10 male), students and staff at ANON-UNI, 
were recruited to take part in our user study. The average 
age was 23.2 (SD = 1.4). The majority spent 3 to 7 hours 
everyday using computer and had more than 8 year experi¬ 
ence of using Internet. 


^http://www.bbc.co.uk/news/business-25356956 

^http://www.iab.net/iablog/2014/03/viewability-has-arrived- 

what-you-need-to-know-to-see-through-this-sea-change.html 


Materials 

We created 5 webpages with an article per page on the topic 
of sports, food, game, joke and gossip, respectively. We 




































Home Page 1 


Fast-food 'linked to childhood asthma and eczema' 

Advertisements 

1 1 

Eating fast food three a week may 

J more than so countries suggests poor diet 
may be te blame for rising levels of these 
allergy-related Those who ate 

1 I ! 


risked severe asthma, eczema and itchy, watery eyes. Ealing plenty of fruit appears to be protective, ! 

Thorax journal reports. Fast food often contains high levels of saturated- and trans-fatly acids, v\rfiich 


researchers. ^ 


In the study, children in their early teens who ate three or more weekly servings of fastfood had a 39% i 

increased risk of severe asthma. 

ONLINE ONLY 

Six- and seven-year-olds had a 27% increased risk. 


rhinoconjunctivitis by between 11 % and 14%. 

OFF ANY PIZZA 

Generally, people with asthma do not have to followa special diet 

In some cases, certain foods, such as cow's milk, eggs, fish, shellfish, yeast products, nuts, and some 

Atmeoupctcfe 
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beneficial effect on asthnta therefore Asthma UK advises people with asthma to eat a healthy, 
balanced diet including five portions of fruit or vegetables every day, fish more than twice a week, and j 

pulses more than once a week." 

News from BBC: "http:/Avww.bbc.co.uk/news/health-2100965^^ j 



Figure 2. Example test webpage. 


then randomly downloaded 60 ad creative images with uni¬ 
form size of 300 x 250 pixels^. These ads covered vari¬ 
ous product categories, including electronic devices, food, 
sports, games, radio, shoes, education, gamble etc. We 
allocated 6 ads for each page, and deployed the impres¬ 
sion tracking algorithm with all 11 measurements to each 
ad. In order to remove the position bias of each ad on the 
webpages, the ads were shuffled during page loading. The 
viewport on a sample webpage is shown in Figure 2. 


Procedure 

Prior to starting the assessed part of the experiment, each 
participant completed the first questionnaire about their de- 
mographical information. For the next five minutes'^, each 
participant was asked to read the webpages freely without 
telling them the time limit for browsing the webpages and 
there would be questions about ads afterwards. After the 
participant completing browsing the webpages, they were 
presented with the second questionnaire. This question¬ 
naire consisted of 60 ads with 30 appeared in the webpages 
and other 30 never shown. Participants were told to se¬ 
lect the ads which they had just seen, and to give their sub¬ 
jective assessment for level of confidence in remembering 
each selected ad. The level of confidence was elicited on 
7-point Likert scales with the anchor 1 (Not confident) - 
7 (Strongly confident). Finally, they were presented with 
the post-experimental questionnaire relating to the level of 
understanding webpages’ content as well as the degree of 
their attention to the ads while reading the webpages. 


Scoring 

The counting of each ad impression is a bi-classiflcation 
problem. For each user-ad pair, the measurement judged 
whether the user had viewed this ad. And the labels were 
given by the user feedback on the second questionnaire. We 
used precision, recall and FI score to evaluate the impres¬ 
sion counting performance of each measurement. Precision 
value is the fraction of counted relevant ad impressions over 


^According to lAB, this is one of the most standard display ad 
sizes. http://www.iab.net/wiki/index.php/Ad_banner 
"^We choose such time length according to the users’ average stay 
time against the word number on webpages according to [9]. 
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Figure 3. Average FI and boxplot of FI score for each measurement. 
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Figure 4. Boxplot of FI score, precision and recall against each pixel 
percentage and exposure time. 


all the counted ones. Recall is the fraction of counted rele¬ 
vant ad impressions over all the user selected valid ad im¬ 
pressions. FI score is calculated based on precision and 
recall: FI = 2PR/{P F R). 

We also used the FI score to evaluate the user’s selection on 
valid impressions and article topics. The user might select 
some ads which were actually never occurred in the web¬ 
pages. Also the user would select the wrong article topics. 

EMPIRICAL RESULTS 
Algorithm Performance 

The FI score and its distribution in box plot [7] for 
each measurement as well as the measurements’ pixel 
percentage threshold and exposure time interval are pre¬ 
sented in Figure 3. We observed that measurement 6 
(pixel > 75%, time G [2,4)) and measurement 9 (pixel > 
75%, time G [4,+oo)) provided the highest average FI 
score and median FI score. This suggested that an ad was 
probably viewed by the user if at least 75% of its pixels had 
been shown in the viewport for no less than 2 seconds. 

In Figure 4, we further analysed the distribution difference 
of FI, precision and recall against different pixel percent¬ 
ages and exposure time respectively. From the result we 
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Figure 5. Numbers of pixel percentage and exposure time for the user Figure 6. Users ad selection precision against their attention on ads, 

selected ads, with the user ad selection confidence on different expo- content and content precision, 

sure time. 


found that generally with the higher pixel percentage or 
the longer exposure time, the recall decreased whereas the 
precision increased. This finding is intuitive because the 
more ad display surface and the longer exposure time lead 
to the higher user ad recall. On the other hand, the higher 
pixel percentage and exposure time thresholds usually fil¬ 
tered out some of the truly viewed ads, leading to the lower 
recall. For the FI score distribution, 75% pixel percentage 
threshold and [2, 4-oc) exposure time leaded the highest FI, 
which was consistent with the individual measurement per¬ 
formance discussed above. 

User Analysis 

We then investigated participants’ self-reports of level of 
confidence in ad recall. Figure 5 shows the numbers of 
each pixel percentage, exposure time interval and the par¬ 
ticipants’ ad selection confidence against the exposure time. 
We observed that 97.3% of the user selected ad impressions 
were displayed with 100% pixels, which meant the users 
almost just noticed on the ads that had been fully shown 
in the viewport^. On the exposure time dimension, most 
of the ad impressions were exposed for more than 4 sec¬ 
onds and in such case the user’s ad selection confidence 
was much higher than that with shorter exposure time. A 
one-way ANOVA was conducted on confidence to compare 
the effect of exposure time. There was a significant effect 
of exposure time. F(3,145) = 4.042, p = .009. Post hoc 
comparisons using the Tukey HSD test indicated that the 
mean score for the time G [4, +oo) condition was signifi¬ 
cantly higher than the time G [2,4) condition, p = .023 . 
However, the time G [2,4) condition did not significantly 
differ from the time G [0,2) condition p = .965 , and the 
time G {0} condition p = .999. 

It is possible that some user-selected ad impressions were 
in fact not shown in the webpages at all. In Figure 6, we 
studied the precision of user ad selection against the users’ 
reported levels of attention on ads, webpage content and 
their webpage topic selection precision. The relationships 
between ad selection precision and all these three variables 
were positive. Particularly, the last two subfigures in Fig¬ 
ure 6 indicate an unintuitive finding that when the users pay 
more attentions on the webpage content, they will also un¬ 
intentionally notice more ads. 

CONCLUSIONS 

In this paper, from the human-computer interaction per¬ 
spective, we studied the contribution of ad impression 

^However, a quarter of the counted impressions with 100% pixels 
only had less than 2 seconds exposure time, which corresponded 
to the measurement 4 with low performance. As a result shown in 
Figure 4, 75% pixel overall worked better than 100% pixel. 


viewability from the display pixel percentage and exposure 
time for an ad display opportunity. The results showed that 
with no less than 75% of pixels being shown in the viewport 
for at least 2 continuous seconds, the ad impressions were 
effectively and sufficiently counted. Furthermore, compar¬ 
ing the dimensions of pixel percentage and exposure time, 
the study indicated that the exposure time acted as a more 
significant factor for the users’ ad recall confidence on the 
ad impressions than the pixel percentage. In the future 
work, we plan to investigate more factors which could make 
difference on users’ ad recall, such as the ad creative sizes, 
ad slot positions and ad image formats. And we plan to de¬ 
ploy our impression viewability measurement on a partner 
commercial display advertising platform to check whether 
the effectively “viewed” ad impressions would improve the 
advertising performance in a long term. 
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